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Abstract 

What is the dominating mechanism of the price dynamics in financial systems is of great interest to 
scientists. The problem whether and how volatilities affect the price movement draws much attention. 
Although many efforts have been made, it remains challenging. Physicists usually apply the concepts 
and methods in statistical physics, such as temporal correlation functions, to study financial dynamics. 
However, the usual volatility-return correlation function, which is local in time, typically fluctuates around 
zero. Here we construct dynamic observables nonlocal in time to explore the volatility-return correlation, 
based on the empirical data of hundreds of individual stocks and 25 stock market indices in different 
countries. Strikingly, the correlation is discovered to be non-zero, with an amplitude of a few percent and 
a duration of over two weeks. This result provides compelling evidence that past volatilities nonlocal in 
time affect future returns. Further, we introduce an agent-based model with a novel mechanism, that is, 
the asymmetric trading preference in volatile and stable markets, to understand the microscopic origin of 
the volatility-ret urn correlation nonlocal in time. 


Author Summary 
Introduction 


Financial markets, as a kind of typical complex systems with many-body interactions, have drawn 
much attention of scientists. In recent years, for example, various concepts and methods in statistical 
physics have been applied and much progress has been achieved [l 22 . Following the trend towards 


quantitative analysis in finance, the efforts of scientists in different fields promote each other and deepen 
our understanding of financial systems j 1,6, [F7|[23 37 . 

From the perspective of physicists, a financial market is regarded as a dynamic system, and the 
price dynamics, i.e. the time evolution of stock prices, can be characterized by temporal correlation 
functions, which describe how one variable statistically changes with another. It is well-known that the 
price volatilities are long-range correlated in time, which is called volatility clustering. Many activities 
have been devoted to the study of the collective behaviors related to volatility clustering in stock 
markets PPPPP1 - However, our understanding on the movement of the price return itself is very much 
limited. The autocorrelating time of returns is extremely short, that is, on the order of minutes |3|j38|. As 
to higher-order time correlations, it is discovered that the return-volatility correlation is negative — in 
other words, past negative returns enhance future volatilities (4j 9 [[23j[40j(4T] . This is the so-called leverage 
effect. As far as we know, all stock markets in the world exhibit the leverage effect except for the Chinese 
stock market, which unexpectedly shows an anti-leverage effect, i.e., the correlation between past returns 
and future volatilities is positive [9 41 . Returns represent the price changes, and volatilities measure the 


fluctuations of the price movement. The leverage and anti-leverage effects characterize how price changes 
induce fluctuations. At this stage, one may ask what affects the return itself. It has been discovered that 


future returns can be predicted by the dividend-price ratio 42 43 , which is corroborated by subsequent 
studies. However, the predictive power of the dividend-price ratio is sensitive to the selection of the 
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sample period 44 45 
series 
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Recently, price extrema are found to be linked with peaks in the volume time 


Moreover, it is reported that massive data sources, such as Google Trends and Wikipedia, 
contain early signs of market moves. The argument is that these “big data” capture investors’ attempts to 
gather information before decisions are made 35,46 47 . These researches provide insight into the price 
dynamics. 

What is the dominating mechanism of the price dynamics is highly complicated. The problem how 
volatilities affect the price dynamics has drawn much attention. Although many efforts have been made, 
it remains enormously challenging. According to a hypothesis known as the volatility feedback effect, 
an anticipated increase in volatility would raise the required return in the future. To allow for higher 
future returns, the current stock price decreases 24 25 . Based on this hypothesis, various models, such 


as Generalized AutoRegressive Conditional Heteroskedasticity (GARCH) model 48 and Exponential 
GARCH (EGARCH) model 49 have been applied to examine the correlation between past volatilities 
and future returns, and the results are controversial. The correlation is discovered to be positive in 
some researches 24j|25|. while negative in others 49-51 . Often the coefficient linking past volatilities 
to future returns is statistically insignificant [261. On the other hand, the volatility-return correlation 
function can be used to characterize the correlation between past volatilities and future returns. If the 
hypothesis of the volatility feedback effect is valid, the volatility-return correlation function should be 
non-zero. However, it typically fluctuates around zero [4j[9]. Such a volatility-return correlation function 
can only characterize the correlation local in time. In fact, the scenario in financial markets may be more 
complicated. Interactions, and thus correlations could be nonlocal in time. 

In this study, we construct a class of dynamic observables nonlocal in time to explore the volatility- 
return correlation, based on the empirical data of hundreds of individual stocks in the New York and 
Shanghai stock exchanges, as well as 25 stock market indices in different countries. Strikingly, the 
correlation is discovered to be non-zero, with an amplitude of a few percent and a duration over two 
weeks. This result provides compelling evidence that past volatilities nonlocal in time affect future returns. 
Further, we introduce an agent-based model with a novel mechanism, that is, the asymmetric trading 
preference in volatile and stable markets, to understand the microscopic origin of the volatility-return 
correlation nonlocal in time. 


Materials 

We collect the daily closing prices of 200 individual stocks in the New York Stock Exchange (NYSE), 200 
individual stocks in the Shanghai Stock Exchange (SSE) and 25 stock market indices in different countries. 
The time periods of the individual stocks and stock market indices are presented in Table [l] All these 
data are obtained from Yahoo! Finance (finance.yahoo.com). To keep the time periods for all stocks 
exactly the same and as long as possible, we select 200 stocks in the SSE, most of which are large-cap 
stocks. For comparison, 200 stocks in the NYSE are collected. 
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Table 1. The time period, effective pair of time windows and maximum APq. The time 
period, effective pair of T\ and T 2 , and maximum APq for the individual stocks in the NYSE and SSE, as 
well as 18 stock indices. The volatility-return correlation nonlocal in time is positive for all these indices 
and stocks, except for the Australia and Japan indices, which exhibit a negative volatility-return 
correlation. For other 7 indices, nonzero A P(t) could not be detected for almost all pairs of T\ and T 2 . 
These indices include MERV (Argentina 1996-2012), BSESN (India 1997-2012), KLSE (Malaysia 
1993-2012), KJSE (Indonesia 1997-2011), OMXC20.CO (Denmark 2000-2012), OSEAX (Norway 
2001-2012) and FTSE (England 1984-2012), which are not listed in this table. 


Index 

Period 

Effective T\ 

Effective T 2 

max APq 

200 stocks in the NYSE 

1990-2006 

6-36 

45-250 

0.006 

200 stocks in the SSE 

1997-2007 

4-44 

95-250 

0.032 

BVSP (Brazil) 

1993-2012 

26-44 

60-105 

0.027 

GSPTSE (Canada) 

1977-2012 

19-40 

190-240 

0.029 

IPSA (Chile) 

2003-2012 

31-44 

190-220 

0.032 

Shanghai Index (China) 

1990-2009 

27-44 

80-105 

0.036 

S&P 500 (America) 

1950-2011 

29-36 

185-225 

0.012 

DAX (German) 

1959-2009 

28-44 

85-250 

0.015 

KOSPI (Korea) 

1997-2012 

26-44 

90-115 

0.027 

MXX (Mexico) 

1991-2012 

6-19 

65-140 

0.029 

NZ50 (New Zealand) 

2004-2012 

27-32 

100-120 

0.033 

IBEX (Spanish) 

1993-2012 

11-25 

105-225 

0.026 

OMX (Sweden) 

1998-2012 

7-17 

55-160 

0.043 

SSMI (Switzerland) 

1990-2012 

27-44 

80-110 

0.023 

FCHI (France) 

1990-2012 

18-19 

135-145 

0.021 

AEX (Holland) 

1982-2012 

21-23 

50-65 

0.023 

Shenzhen (China) 

1991-2009 

3-31 

90-250 

0.045 

DJI (America) 

1928-2011 

39-41 

200-225 

0.006 

AORD (Australia) 

1984-2012 

11-44 

45-250 

-0.032 

N225 (Japan) 

1984-2011 

34-44 

200-250 

-0.017 


Methods and Results 

Asymmetric conditional probability in volatile and stable markets 

To explore the volatility-return correlation in stock markets, we construct a class of observables, including 
conditional probabilities and correlation functions. We first discuss the conditional probabilities. 

The price of a financial index or individual stock at time t' is denoted by Y ( t and the logarithmic 
return is defined as R{t') = InY ( t') — In Y ( t' — 1). For comparison of different indices or stocks, we 
introduce the normalized return 

r(t') = [R(t')-(R(t'))}/a. (1) 

Here (• • •) represents the average over time t'. In other words, (R ( t’)) = EiLi ^(*)] / n ’ ls the average of the 
time series R (t'), where n denotes the total number of the data points of R (<'), and a = [(R 2 ) — ( R y] 1 / 2 
is the standard deviation of R(t'). There may be various definitions of volatility, a simplified one is 

v(t') = \r(t')\, (2) 

which measures the magnitude of the price fluctuation. 

One may compute temporal correlation functions to investigate the dynamic correlations. The usual 
volatility-return correlation function is defined as f{t) = (v ( t') • r (t' +1 )) with t > 0, and it characterizes 
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how the volatility at t! influences the return at t' + 1. However, this correlation function fluctuates around 
zero j4|9 . It is noteworthy that such a kind of f(t) is local in time, while interactions such as information 
exchanges in financial markets may be more complicated, leading to correlations nonlocal in time. 

To explore the correlations nonlocal in time, we first define an average volatility at t' over a past 
period of time T, 

( v (t')) T = (t' - i + 1). (3) 

i —1 

To evaluate whether the average fluctuation in a short time period T\ is strong or weak, we compare 
it with a background fluctuation, which is defined over a much longer period of time T 2 in the past. 
Therefore, we introduce the difference of the average volatilities in two different time windows, 


A v(t') = (v(t')) Ti -(v(t')) T2 , 


(4) 


with T 2 Ti. Ti and T 2 are called the short window and long window, respectively. When Av {t') > 0, 
the stock market in the time window Ti is volatile; otherwise, it is relatively stable. 

Next, we compute the conditional probability P + {t)\Av(t')> 0 i which is the probability of r(t' + t) > 0 
on the condition of Av(t r ) > 0. Here we consider only t > 0. Correspondingly, the conditional probability 
P + {t)\Av(t')<o is the probability of r(t' +1) > 0 for A v(t') < 0. We do not observe any r(t' +1 ) equal to 0 
in the normalized return series. Thus, the conditional probability of r(t' + t) < 0 is 1 — P + (t)\Av(t')>o 
and 1 — P + (t)\Av(t')< 0 i respectively. In a time series of returns, the total number of positive returns is 
generally different from that of negative ones. Let us denote the unconditional probability that the return 
is positive by Po(t), which is the percentage of the positive returns in all returns without any condition. 

The specific calculations for P + (t)\A v (t')> 0 i P + (t)\Av(t.')<o and Po{t) are described in SI Appendix If 
past volatilities and future returns do not correlate with each other, both P + {t)\Av(t')>o and P + (t)\Av{t')<o 
should be equal to Po(t). In other words, if P + {t)\Av(t')>o and P + {t)\Av(t’)<o are different from Po(t), i.e., 
if the conditional probability of returns is asymmetric in volatile and stable markets, there exists a non-zero 
volatility-return correlation and such a correlation is nonlocal in time. In this case, it can be proven that 
if p + (t)\ Avtt'p o > Po(t ), we have P + (t)\ A v(t')<o < p o(t), otherwise, we have P + {t)\ Av (t')<o > p o(t) (see 


SI Appendix). To describe the asymmetric conditional probability in volatile and stable markets, we 

A P(t) = P + (t)\Av(t')>0 - P + (t) |A„(t')<0- (5) 


introduce 


It is important that the probability difference AP(t) relies on A v(t'), thereby depending on the time 
windows T\ and T 2 . Even though the volatility-return correlation function local in time is zero, the 
nonlocal observable A P(t) can be non-zero. We call a pair of T\ and T 2 at which A P(t) is non-zero an 
effective pair. 

At the time windows Ti = 24 and T 2 = 205, for instance, we compute A P(t) for 200 stocks in the 
NYSE and take an average over these stocks. As displayed in Fig.[lja), the average A P(t) remains positive 
for over 20 days with an amplitude of 1 percent. The result indicates that the past volatilities nonlocal in 
time enhance the positive returns in the future. For comparison, three curves for A P(t) averaged over 
150, 100 and 50 randomly chosen stocks are also displayed. Within fluctuations, these three curves are 
consistent with that for A P(t) averaged over 200 stocks. We take the average over many stocks for the 
purpose of exploring the collective behavior of stocks. For a single stock, the price dynamics is much 
more complicated, and A P(t) fluctuates more strongly. Then we perform the same computation for 200 
stocks in the SSE at the time windows Ti = 10 and T 2 = 210. As displayed in Fig. EH A P(t) remains 
positive for about 40 days and the amplitude is about 5 percent. Compared with the results for the NYSE, 
the amplitude and duration of A P(t) for the SSE are respectively much larger and longer. The reason 
may be that the US stock market is highly developed, with large market size and diversified investment 
philosophies, while the Chinese stock market is emerging and of small market size, in which the investment 
philosophies of investors resemble each other. 
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Figure 1. The probability difference for the individual stocks. The probability difference A P(t) 
for (a) 200 individual stocks in the SSE and (b) 200 individual stocks in the NYSE. The black line shows 
A P(t) averaged over 200 stocks with error bars. The other lines represent A P(t) averaged over 150, 100, 
and 50 randomly chosen stocks. The time windows are T\ = 24 and T 2 = 205 for the NYSE, and Xi = 10 
and T 2 = 210 for the SSE. 


For the validation of our methods, each point of A P(t) in Fig. [I] is analyzed by performing Student’s 
f-test. In general, a p- value less than 0.01 is considered statistically significant. For the NYSE, the smallest 
p- value is in the order of 10 —12 , and all the p- values for 1 ^ t ^ 19 are less than 0.01. For the SSE, the 
p- values are even smaller, and less than 0.01 for 1 ^ t ^ 52. 

Actually the definition of volatility in Eq. <© is a simplified one. A more standard definition of 
volatility at t' is 


vx (t r ) 


-J2 r2 (^ , -* + l ) 


1/2 


( 6 ) 


i—1 


where m represents a relatively small time window, which may be set to be 5 days, i.e., the number of the 
trading days in a week. Given that these two definitions v(t' ) and Vi(t') may lead to different results in 
extreme volatility regimes, we consider both of them in our calculations. For iq (t r ). the average volatility 
at t' over a past period of time T is (vi{t')) T = ^ m+1 v\ (t 1 — i + 1)]/ (T — m + 1), with T ^ m. Thus, 

the difference of the average volatilities in two different time windows is Arq ( t') = (tq — (iq ( t ’)) To . 

For further comparison, one may also define the average volatility at t' over a past period of time T as 
(t '2 = [1 /T ■ Yli=i r 2 (^ — * + l)] 1 / 2 - Thus the difference of the average volatilities in two different 
time windows is Aiq (t') = (v? (t')) T — (V2 {V)) T ■ For Avi ( t ') and Av2 it') respectively, the probability 
difference is 

APi(i) = P + (t)\Av 1 (t' ) >o - P + (t)Uv^xo (7) 

and 

A P 2 (t) = P + (t )| A , 2 ( t ')>o - P + (t) U« 2 (t')<o- ( 8 ) 

For the NYSE and SSE respectively,we compute APi(t) and AP 2 (i), and take an average over individual 
stocks. The time windows are the same as those for A P(t) in Fig. [I] As displayed in Fig. [ 2 } the curves 
for A P(t), AP\{t) and AP 2 (t) overlap each other within fluctuations. In the following calculations, we 
mainly consider A P(t) and APi(t). 
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Figure 2. The probability differences for three definitions of volatility. The results are 
averaged over individual stocks. For the NYSE, the time windows are T\ = 24 and T 2 = 205. The result 
for the SSE is displayed in the inset, with the time windows T\ = 10 and T 2 = 210. 


Effective pairs of and T 2 

In the calculation of A P(t), the time windows Xi and T 2 are crucial. T\ represents the recent period of 
time, and investors measure the current fluctuation of prices according to the volatility averaged over 
T\. Thus, Ti should be relatively small. In our calculations, Ti ranges from 1 to 44 days. Here 44 is 
the number of trading days in two months. T 2 stands for the period of time in which one estimates the 
background of volatilities in the past. Theoretically, T 2 should be much larger than Tf. On the other 
hand, T 2 should not be arbitrarily large either: firstly, the memory of investors may not last very long; 
secondly, maybe more importantly, T 2 reflects the long-term fluctuation of stock markets, which should be 
reasonably fixed. In our calculations, T 2 ranges from 45 to 250 days. Here 250 is the number of trading 
days in a year. In fact, T 2 is more crucial to A P(t) than Ti. If T 2 were equal to the total length of the 
volatility series, (v ( t ')) T would be a constant, and A P(t) would become a local observable, which is just 
a volatility-return correlation function local in time but averaged over a Ti-day moving window. 

In Fig. [lja) and (b), we display A P(t) computed with a specific effective pair of T\ and T 2 . Actually, 
the effective pair of T\ and T 2 is not unique, and exists in a particular region. Therefore, we compute 
A P(t) with each pair of T\ and T 2 , and identify the effective pairs at which A P(t) is significantly non-zero. 
Since A P(t) needs to be computed in a large region of Ti and T 2 , it is inefficient to observe the behavior 
of A P(t) by eyes. Besides, due to the fluctuation of AP(t), the visual observation could be difficult in 
some cases. Therefore, we propose technical criteria to efficiently discriminate the non-zero A P(t). 

The schematic diagram of the criteria is displayed in Fig. [3] The criteria comprise four steps: 
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Figure 3. A schematic graph of the criteria for identifying non-zero AP. The red line 
represents AP'(f),which is the 3-point smoothed A P(t). ti is the day when the sign of A P'(t) changes 
for the first time. We define AP'(f) in the range of 1 < t < t\ — 1 as the first part, and that in the range 
of ti < t < t\ + r — 1 as the second part. The average absolute values for the first and second parts are 
denoted by AP\ and AP 2 , respectively. 

(1) A P(t) is smoothed with a 3-day moving window and the result is denoted by A P'(t). 

(2) Supposing A P'(t) changes sign for the first time at ti, we define A P'(t) in the range of 1 < t < t\ — 1 
as the first part, and that in the range of t\ < t < t\ — 1 + r as the second part. A non-zero A P'(t) would 
remain positive or negative in the first part, while fluctuate around zero in the second part. We set r to 
be 44, i.e., the number of the trading days in two months, which is long enough to confirm whether the 
second part of A P'(t) fluctuates around zero. 

(3) we calculate the average absolute values for the first and second parts of A P'(t), denoted by AP\ 
and AP 2 respectively. 

(4) For a non-zero A P(t), it has to be satisfied that (i)AP'(f) > AP 2 for 1 ^ t ^ to, and to > 10; 
(ii) each value of AP'(f) in the second part is smaller than APi. With these conditions, we sift out the 
non-zero A P(t) preliminarily. To measure how significantly AP(f) differs from zero, we calculate the 
average value of AP'(t) for 1 < t < to, which is denoted by AP 0 . Actually, the larger \APq\ is, the more 
significantly AP(t) differs from zero. The average of non-zero APo over different pairs of Ti and T 2 is 
denoted by APq. To consolidate our results, we identify those non-zero A P(t), which meet an additional 
requirement: (iii) |AP 0 | > |AP 0 |. AP 0 is set to 0 unless AP(t) satisfies all the requirements above. 

Now we compute AP(t) for the individual stocks in the NYSE with each pair of Ti and T 2 . AP(t) is 
averaged over 200 stocks, and the corresponding AP 0 is calculated. The landscape of AP 0 is displayed in 
Fig. Qa). The result indicates that the effective pairs of T\ and T 2 do exist in a particular region, and 
both Ti and T 2 are characteristics of the stock markets. In Fig. |4ja), the effective pairs of T\ and T 2 are 
basically adjacent to each other, suggesting that A P(t) locally is not very sensitive to Ti and T 2 . This is 
somehow expected, since A P(t) is computed from the volatilities averaged over T\ and T 2l and a little 
alteration in Ti or T 2 would not dramatically change AP(f). From this perspective, the gaps between the 
disconnected regions in Fig. |4^a) probably result from the fluctuations, especially taking into account the 
relatively small amplitude of non-zero A P(t) for the NYSE. 


















Figure 4. The landscape for the amplitude of A P(t). The amplitude AP 0 of A P{t) at different 
time windows T\ and T 2 for (a) 200 individual stocks in the NYSE and (b) 200 individual stocks in the 
SSE. Xi ranges from 1 day to 44 days, and the increment is 1 day. T 2 is from 45 to 250 days, with an 
increment of 5 days. The larger APq is, the more significantly A P(t) differs from zero. For A P(t) 
fluctuating around zero, APq = 0. 


Next, we perform a parallel analysis on the 200 stocks in the SSE, and the landscape for the amplitude 
of A P(t) is displayed in Fig. [4jb) . Similar with the result for the NYSE, a large region of non-zero A P(t) 
is observed for the SSE. At a single pair of Ti and T 2 , A P(t) averaged over 200 stocks would generally 
be non-zero, if A P(t) of some stocks is non-zero. Moreover, the region of non-zero A P(t) varies from 
one stock to another. Therefore, the average A P(t) of the individual stocks is non-zero in a relatively 
large region for both the NYSE and SSE. Additionally, as displayed in Fig. [4jb) , there exists only one 
connected region of non-zero A P(t) for the SSE, with the amplitude dwindling from the center to the 
edge. Compared with the result for the NYSE in Fig. Qa), the region of non-zero APq in Fig. |4jb) is 
broader, without gaps, and the value of APq is almost an order of magnitude larger. The reason may be 
traced back to the fact that the Chinese stock market is emerging, and less efficient than the US stock 
market. To further validate our methods, we perform Student’s t-test on each point of non-zero A P(t) in 
Fig- 0 A p -value less than 0.01 is considered statistically significant. At an effective pair of Ti and T 2 , 
A P(t) is confirmed to be non-zero, if all the p-values are less than 0.01 for 1 ^ t ^ 10. All non-zero A P(t) 
are confirmed except for a few ones at very small T\. 

We also compute APi(t) with different pairs of T\ and T 2 for the NYSE and SSE. Since m in Eq. ([6| 
is set to be 5, T\ should not be smaller than 5. The landscapes for the amplitude of APi(t) are almost 
the same as those for the amplitude of A P(t). 

Further, we compute A P(t) for the 25 stock market indices in different countries. The volatility-return 
correlation is positive for 16 indices, and the corresponding effective pairs of T\ and T 2 , as well as the 
maximum APq 7 are given in Table [T] For most of these indices, the maximum APq is over 2 percent, 
indicating that the correlation is rather prominent. In Fig. [5][a) , we display the regions of effective pairs 
of Ti and T 2 for 5 representative indices including the Brazil, Shanghai, Mexico, Spanish and S&P 500 
indices. For other 7 indices, nonzero A P(t) could not be detected for almost all pairs of T\ and T 2 . 
Exceptionally, the Australia and Japan indices exhibit a negative volatility-return correlation, i.e., the 
volatilities in a past period of time enhance the negative returns in future times. The effective pairs of 
Ti and T 2 , as well as the maximum APq for these two indices, are also presented in Table [I] We also 
compute APi(t) for the 5 representative indices, and the regions of effective pairs of T\ and T 2 are shown 
in Fig. §b). Compared with Fig. [5ja), the regions of the effective pairs of T\ and T 2 in Fig. j5jb) change 
slightly. The reason may be that the fluctuation of A P(t) and APi(i) for indices is stronger than that for 
the individual stocks. 





























































































































































































































































9 



Figure 5. The effective pairs of time windows for five representative indices. The probability 
difference (a) A P(t) and (b) A P\{t) for five stock market indices. Different colors represent the regions 
of effective pairs of Tj and X 2 for different indices. Specifically, navy stands for the Brazil Index, orange 
stands for the Shanghai Index, red stands for the Mexico Index, yellow stands for the Spanish Index and 
crimson stands for the S&P 500 Index. For clarity, we display only one color at the overlapping regions, 
given that these regions are small. Some scattered points are also omitted. 


To confirm that the nonlocal volatility-return correlation is indeed a nontrivial dynamic property of 
the stock markets, we randomly shuffle the time series of returns, i.e., randomize the time order of the 
returns, and perform the same calculation. In this case, A P(t) just fluctuates around zero. The result 
provides evidence that the correlation does originate from the interactions between past volatilities and 
future returns. 


Volatility-return correlation functions nonlocal in time 

Up to now, we have only concerned with the signs of A v{t') and r(t' + 1) in computing A P(t). Actually, 
the magnitudes of Av(t') and r(t' + t ) should also be important to both theoretical analysis and practical 
applications. Taking into account the magnitudes of Av(f') and r(t' + t ), we may explicitly construct a 
correlation function nonlocal in time to describe the volatility-return correlations, 

F(t) = (Av(t').r(t'+t)). (9) 

Both A P(t) and F(t) reflect the asymmetric behavior of r{t' +t) in volatile and stable markets, but A P(t) 
should be more fundamental. When A P(t) is non-zero, F(t) would be zero only if the contributions of 
r(f' + 1) > 0 and r(t' + t) < 0 happen to cancel each other. 

We compute F(t) with different pairs of Tf and T 2 for the 200 stocks in the NYSE and SSE respectively, 
and identify the non-zero ones with the same criteria for the non-zero A P(t). We also introduce AFq to 
describe how significantly F(t) differs from zero, of which the definition is the same as AP 0 for A P(t). 
F(t) is averaged over 200 stocks, and the landscape of the corresponding AF 0 is displayed in Fig. [6j 
The dynamic behavior of F(t) is qualitatively the same as that of A P(t) but quantitatively different. 
Both the amplitude of F(t) and the region of effective pairs of Tj and T 2 are smaller than those of 
AP(t). The fluctuation of F(t) is also somewhat stronger. Student’s t-test is performed on the non-zero 
F(t) and almost all of them are confirmed to be non-zero. We also compute Fj (t). which is defined as 
Fi(t) = (Avi(t') ■ r(t' + t)), with each pair of Xj and T 2 for the NYSE and SSE, and the results are almost 
the same as those for F(t). 
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SSE (200 stocks 1997-2007) 



Figure 6. The landscape for the amplitude of F(t). The amplitude AF 0 of F(t) at different time 
windows Ti and T 2 for (a) 200 individual stocks in the NYSE and (b) 200 individual stocks in the SSE. 


In fact, A P(t) can be expressed as the correlation function G(t) = ( sgn(Av(t ')) ■ sgn(r(t' +1))). Here 
sgn(x) represents the sign of x. G(t) behaves almost the same way as A P(t) does. Additionally, one may 
also define another volatility-return correlation function H(t) = ( sgn(Av(t ')) • r(t' + t)). Since only the 
magnitude of r(t' + t) is taken into consideration, H(t) is less fluctuating than F(t), whereas the result 
looks qualitatively similar. 

There have been many researches with different methods focusing on how volatilities affect returns in 
financial markets. A direct way is to calculate the usual volatility-return correlation function, which is 
defined as f(t) = ( v (t ') • r (t 1 + t)) with t > 0. However, the result fluctuates around zero [4,9 . In the 
past years, various GARCH-like models are applied to investigate the correlation between past volatilities 
and future returns. In these models, the future returns are assumed to be correlated with the past 
volatilities, and there are coefficients quantifying the correlation. The results are controversial. The 

More 


correlation is discovered to be positive in some researches 24 25 


but negative in others 49-51 

[26|. 


often, the coefficient linking past volatilities and future returns is statistically insignificant [26 . From 
our perspective, these studies only characterize the volatility-return correlation local in time. In our 
work, however, both A P(t) and F(t) are nonlocal in time, which are constructed based on the difference 
between the average volatilities in two different time windows. The correlation characterized by A P(t) 
and F(t) is more complicated and of higher-order. 


Agent-based model with asymmetric trading preference 

We construct an agent-based model to investigate the microscopic origin of the nonlocal volatility-return 
correlation. Agent-based modeling is a promising approach in complex systems, and has been applied 
successfully to study the fundamental properties in financial markets, such as the fat-tail distribution 
of returns, the long-range temporal correlation of volatilities, and the leverage and anti-leverage effects 


15 18 27 39 52-57 


The basic structure of our model is borrowed from the models in refs. 15 18 , which is built on 
agents’ daily trading, i.e., buying, selling and holding stocks. Since the information for investors is highly 
incomplete, an agent’s decision of buying, selling or holding is assumed to be random. Due to the lack of 
persistent intraday trading in the empirical trading data, we consider that only one trading decision is 
made by each agent in a single day. In our model, there are N agents and each agent only operates one 
share of stock each day. On day t, we denote the trading decision of agent i by 


1 buy 
Si(t) = { — 1 sell 
0 hold 


(10) 












































































































































































































































11 


The probability of buying, selling and holding decisions are denoted by Pbuy, Pseii and Phoid , respectively. 
Assuming that the price is determined by the difference between the demand and supply of the stock, we 
define the return R(t ) as 

N 

R(t) = Y,Si(t). ( 11 ) 

i-1 

Next, we introduce the investment horizon based on the fact that investors make decisions according 
to the previous market performance of different time horizons. It is found that the relative portion 7 j of 
investors with i days investment horizon follows a power-law decay 7 \ oc i -r? with 77 = 1.12. With the 
condition of = 1> 7 i is normalized to be 7 ; = i ~ v / X)f=i w here M is the maximum investment 

horizon. Considering different investment horizons of various agents, we introduce a weighted average 
return R'(t) to describe the integrated investment basis of all agents. Specifically, R'(t) is defined as 


M 


R’(t) = kJ2 


2=1 


Y* R & - j) 


3=0 


( 12 ) 


where k is a proportional coefficient. We set k = l/(X)fci so that \R'(t)\r 


= N=\R(t)\r 


According to ref. 18 , the maximum investment horizon M is set to 150. 


Herding behavior is an important collective behavior in financial markets. We define a herding degree 
D(t) to describe the clustering degree of the herding behavior, 


D(t+ 1) = \R\t)\/N. 


(13) 


On day t + 1, the average number of agents in each group is N • D(t + 1), and therefore we divide all 
agents into 1 /D(t + 1) groups. The agents in a same group make a same trading decision with the same 
trading probability. In ref. 15 , it is assumed that the probabilities of buying and selling are equal, 


i.e., Pbuy = Pseii = P , and p is a constant estimated to be 0.0154. Therefore the trading probability is 
Ptrade = Pbuy + Pseii = 2 p and the holding probability is Phoid = 1 — 2 p. In our model, the trading 
probability is also kept to be 2 p and remains constant during the dynamic evolution. 

Now we introduce a novel mechanism in our model, that is, the asymmetric trading preference in 
volatile and stable markets. In financial markets, the market behaviors of buying and selling are not 
always in balance 58 . Hence, Pbuy and P se u are not always equal to each other. They are affected by 


previous volatilities, and the more volatile the market is, the more Pbuy differs from P se u. 

For an agent with i days investment horizon, the average volatility over previous i days is taken into 
account, which is defined as 


1 


Vi(t) = ^ V ^-3- 

l 

3=1 


i). 


(14) 


Then we define the background volatility as Vm (t ), where M is the maximum investment horizon. On day 
t, the agent with i days investment horizon estimates the volatility of the market by comparing V t (t) with 
Vm (t) ■ Therefore, the integrated perspective of all agents on the recent market volatility is defined as 


m 


" M 

'52'YiViit) 

_ 2=1 


/v M (t). 


Thus, we define the probabilities of buying and selling as 


(15) 


Pbuy(t + 1 ) = p[c ■ £{t) + (1 - C)] 
Pseii (t + 1) = 2p — Pbuy (^ + 1) 


(16) 
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Here the parameter c measures the degree of agents’ asymmetric trading preference in volatile and stable 
markets. Compared with the model in ref. 18 , c is the only new parameter added in our model. We 


speculate that c can be determined from the trade and quote data of stock markets. Unfortunately, the 
data are currently not available to us. 

To judge from the amplitude of the volatility-return correlation, c should be a small number. Let 
us set c to be 1/80. The total number of the agents, N, is 10000. The returns of the initial 150 time 
steps are set to be random values following a standard Gaussian distribution. On day t , we randomly 
divide N agents into l/D(t) groups. The agents in a same group make a same trading decision with the 
same probability. After each agent makes his decision, the return R(t) can be computed. Repeating the 
procedure we produce 20000 data points of R(t ) in each simulation, and abandon the first 15000 data 
points for equilibration. Thus we obtain a sample with 5000 data points. 

After the time series R(t) generated from our model is normalized to r(t), we compute A P(t) with the 
time windows Tj = 3 and T 2 = 150. The result is averaged over 100 samples and displayed in Fig. 0a). 
A P(t) is significantly non-zero with an amplitude of 3 percent, lasting for about 20 days. For comparison, 
three curves for A P(t) averaged over 75, 50 and 25 randomly chosen samples are also displayed. Within 
fluctuations, these four curves are consistent with each other and in agreement with the empirical results. 



Figure 7. The simulation results of the agent-based model, (a) The probability difference 
A P(t) computed with the simulated returns at the time windows Tj = 3 and T 2 = 150. The parameter c 
is 1/80. The black line represents A P(t) averaged over 100 samples with error bars and the other lines 
stand for AP(f) averaged over 75, 50 and 25 randomly chosen samples. A P(t) for different values of c are 
displayed in the inset, with T\ = 3 and T 2 = 150. (b) The amplitude AP 0 of A P(t) at different time 
windows Tj and T 2 . Each A P(t) is averaged over 100 samples. The larger AP 0 is, the more significantly 
A P(t) differs from zero. For AP(t) fluctuating around zero, APq = 0. 


We also perform the simulation with c = 1/40 and c = 1/160, respectively, to investigate the dependence 
of A P(t) on c. As displayed in the inset of Fig. [7|a), the amplitude of A P(t) increases with c, i.e., the 
magnitude of c determines the amplitude of the volatility-return correlation. For c = 1/40, the amplitude 
of A Pit) is about 6 percent, which is in the order of that for the SSE and other markets with a strong 
volatility-return correlation. For c = 1/160, the amplitude of A P(t) is close to that for the S&P500 index, 
of which the volatility-return correlation is relatively weak. Therefore, with c ranging from 1/160 to 1/40, 
our model produces the volatility-return correlation consistent with the empirical results. Additionally, if 
c is negative, the volatility-return correlation will be negative, i.e., the sign of c fixes the correlation to be 
positive or negative. 

Next, we compute A P(t) with different pairs of T\ and T 2 , and determine the region of effective pairs 
of T\ and T 2 . A P(t) is averaged over 100 samples, and the landscape of APq is shown in Fig. 0 b). A 
single region with non-zero A P(t) is observed. For T 2 smaller than 120, for example, A P(t) is almost 
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zero. In other words, the effective pairs of T\ and T% exist in a particular region, which is consistent with 
the empirical results. 


Discussion 

We construct a class of dynamic observables nonlocal in time to explore the correlation between past 
volatilities and future returns in stock markets. Strikingly, the volatility-return correlation is discovered 
to be non-zero, with an amplitude of a few percent and a duration of over two weeks. The result indicates 
that past volatilities nonlocal in time affect future returns. Both the nonlocal dynamic observables A P(t) 
and F(t) rely on two time windows Tf and X 2 . The effective pairs of Tf and X 2 exist in a particular region, 
suggesting that both Tf and T 2 are the characteristics of the stock markets. 

Our results are robust for not only individual stocks but also stock market indices. The volatility-return 
correlation nonlocal in time is detected to be positive for individual stocks in the New York and Shanghai 
stock exchanges, as well as 16 stock indices. For other 7 indices, A P(t) fluctuates around zero. However, 
we suppose there may exist some higher-order correlations between volatilities and returns for these indices, 
which could be described by more complicated nonlocal observables. Exceptionally, other 2 indices exhibit 
a negative volatility-return correlation. 

To investigate the microscopic origin of the volatility-return correlation, we construct an agent-based 
model with a novel mechanism, that is, the asymmetric trading preference in volatile and stable markets. 
Accordingly, a parameter c is introduced to describe the degree of the asymmetric trading preference. 
The simulation results exhibit a positive correlation which is in agreement with the empirical ones. More 
importantly, the effective pairs of Tf and T 2 for simulation results exist in a particular region, which is 
also consistent with the empirical ones. Actually, our model can also produce a negative correlation by 
changing the sign of c. The results reveal that both the positive and negative correlations arise from the 
asymmetric trading preference in volatile and stable markets. In our model, the nonlocality arises from 
the interaction between the integrated perspective on the recent market volatility and the probabilities of 
buying and selling. 

Our results provide new insight into the price dynamics. Contrary to the assumptions in various 
models, the rise and fall of prices turn out to be far from random. To the best of our knowledge, the 
volatility-return correlation nonlocal in time is the only property concerning the control of the price 
dynamics, given that the autocorrelating time of returns is extremely short. This non-zero volatility-return 
correlation implies that there may exist higher-order correlations of returns, which deserves further 
investigation in the future, especially for those 7 indices with A P(t) fluctuating around zero. Furthermore, 
our results indicate that nonlocality is an intrinsic characteristic in the financial markets, which is more 
important than we thought before. Besides the volatility-return correlation in the stock markets, many 
other nonlocal correlations in financial systems are to be explored, which serves as our future agenda. 


Supporting Information 

SI Appendix 

Calculation for P + (t)\A v (t')>oi P + (t)\Av(t')<o and Po(t), and relation among them. 
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